Testing computer program accessibility for users with disabilities, such as for use with mobile phones

ABSTRACT

Disclosed here is a system to enable interaction between a user with a disability and a computer program. The system can obtain a representation of a user interface to present to a user. The system can determine an element associated with the user interface, where the element is configured to provide information to the user, however, the user interface presentation of the element at least partially fails to provide the information to the user. Based on the element, the system can determine an appropriate test to perform. The appropriate test indicates at least two of: a test to perform with a keyboard, a gesture test to perform with a mobile screen reader, and an audio test to perform with a screen reader. The system can generate an indication of the appropriate test. The system can provide the indication of the appropriate test prior to releasing the user interface to the user.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No.17/519,466, filed on Nov. 4, 2021, entitled ENSURING THAT COMPUTERPROGRAMS ARE ACCESSIBLE TO USERS WITH DISABILITIES, SUCH AS FOR USE WITHMOBILE PHONES, now U.S. Pat. No. 11,550,702, which is herebyincorporated by reference in its entirety.

BACKGROUND

The internet strives to operate as an inclusive place. However, thosewith disabilities may still have difficulty finding websites andinformation online because most user interfaces are targeted towardusers who can see, hear, and use a keyboard and mouse.

BRIEF DESCRIPTION OF THE DRAWINGS

Detailed descriptions of implementations of the present invention willbe described and explained through the use of the accompanying drawings.

FIG. 1 shows a user interface to generate an indication of anappropriate test to perform to ensure that a user interface providesintended information to a large set of users, including users withdisabilities.

FIG. 2 shows user interface categories associated with a web page.

FIG. 3 shows user interface categories associated with a nativeapplication.

FIGS. 4A-4B show selection of an appropriate test.

FIG. 5 is a flowchart that illustrates a process to generate anindication of an appropriate test to perform to ensure that a userinterface provides intended information to a user.

FIG. 6 is a block diagram that illustrates an example of a computersystem in which at least some operations described herein can beimplemented.

The technologies described herein will become more apparent to thoseskilled in the art from studying the Detailed Description in conjunctionwith the drawings. Embodiments or implementations describing aspects ofthe invention are illustrated by way of example, and the same referencescan indicate similar elements. While the drawings depict variousimplementations for the purpose of illustration, those skilled in theart will recognize that alternative implementations can be employedwithout departing from the principles of the present technologies.Accordingly, while specific implementations are shown in the drawings,the technology is amenable to various modifications.

DETAILED DESCRIPTION

Disclosed here is a system and method to allow users with a disabilityto interact with computer programs. To ensure that the computer programsare accessible to users with a disability, prior to releasing thecomputer program, the system generates an indication of an appropriatetest to perform to ensure that a user interface can provide intendedinformation even to a user with a disability. The system can obtain arepresentation of the user interface to present to a user. Therepresentation can be a design document describing the user interface, asketch of the user interface, or a functioning user interface. The userinterface can be a graphical user interface, an audio interface, or ahaptic user interface. The system can identify an element associatedwith the user interface, such as a menu, a button, a link, etc., wherethe element is configured to provide information to the user, however,the user interface presentation of the element at least partially failsto provide the information to the user, due to the user's particulardisability or severity of that disability.

Based on the element, the system can determine an appropriate test toperform, where the appropriate test indicates a test to perform with akeyboard, a gesture test to perform with a mobile screen reader (e.g.TalkBack or VoiceOver), and/or an audio test to perform with a screenreader. The system can generate the indication of the appropriate test.For example, when the appropriate test includes the test to perform withthe keyboard, the system can indicate a keyboard key and an effectcaused by activating the keyboard key. When the appropriate testincludes the gesture test, the system can indicate a gesture and aneffect caused by performing the gesture. When the appropriate testincludes the audio test, the system can indicate audio and a function tobe performed by the audio. The system can provide the presentation priorto releasing the user interface to the user, to ensure that theappropriate tests are performed.

A screen reader is a form of assistive technology (AT) that renders textand image content as speech or haptic output, e.g. braille output. Arefreshable braille display or braille terminal is an electro-mechanicaldevice for displaying characters, usually by means of round-tipped pinsraised through holes in a flat surface. Visually impaired computer userswho cannot use a standard computer monitor can use it to read textoutput. Deafblind computer users may also use refreshable brailledisplays. Speech synthesizers are also commonly used for the same task,and a blind user may switch between the two systems or use both at thesame time depending on circumstances.

Screen readers are essential to people who are blind, and are useful topeople who are visually impaired, illiterate, or have a learningdisability. Screen readers are software applications that attempt toconvey what people with normal eyesight see on a display to their usersvia non-visual means, like text-to-speech, sound icons, or a Brailledevice. They do this by applying a wide variety of techniques thatinclude, for example, interacting with dedicated accessibility APIs,using various operating system features (like inter-processcommunication and querying user interface properties), and employinghooking techniques.

In computer programming, the term hooking covers a range of techniquesused to alter or augment the behavior of an operating system, ofapplications, or of other software components by intercepting functioncalls or messages or events passed between software components. Codethat handles such intercepted function calls, events or messages iscalled a hook. Examples of hooking can include intercepting an outputfrom an application sent to the display, to add audio or Brailleindications of the output.

The description and associated drawings are illustrative examples andare not to be construed as limiting. This disclosure provides certaindetails for a thorough understanding and enabling description of theseexamples. One skilled in the relevant technology will understand,however, that the invention can be practiced without many of thesedetails. Likewise, one skilled in the relevant technology willunderstand that the invention can include well-known structures orfeatures that are not shown or described in detail, to avoidunnecessarily obscuring the descriptions of examples.

Ensuring That Computer Programs Are Accessible to Users with aDisability

FIG. 1 shows a user interface to generate an indication of anappropriate test to perform to ensure that a user interface providesintended information to a large set of users, including users withdisabilities. The user interface 100 enables a user to indicate the typeof tests to perform on a program, to ensure that a program 110 isaccessible to users with a disability. The users with a disability maynot be able to use the mouse or the keyboard, or they may not be able toread due to blindness, limited literacy, or other reasons.

The program 110 can include web programs, such as web pages 110A ornative applications 110B. The native applications 110B can be associatedwith various devices such as mobile phones, computers, tablets, wearabledevices, etc. User interfaces of programs 110A, 110B can includegraphical user interfaces, audio user interfaces, and/or haptic userinterfaces.

FIG. 2 shows user interface categories associated with the web page. Ahardware or software processor executing instructions described in thisapplication can determine the type of program associated with the userinterface, and based on the type of program can determine the multiplecategories associated with user interface elements. When the program isa web page 110A in FIG. 1 , the processor creates the followingcategories: HTML 200, header (or banner) 210, navigation 220, main 230,form 240, and footer (contentinfo) 250. Other categories can be includedsuch as aside, article, section. The aside HTML element represents aportion of a document whose content is only indirectly related to thedocument's main content. Asides are frequently presented as sidebars orcall-out boxes. The article HTML element represents a self-containedcomposition in a document, page, application, or site, which is intendedto be independently distributable or reusable (e.g., in syndication).Examples include: a forum post, a magazine or newspaper article, or ablog entry, a product card, a user-submitted comment, an interactivewidget or gadget, or any other independent item of content. The sectionelement is a structural HTML element used to group together relatedelements. Each section typically includes one or more heading elementsand additional elements presenting related content.

The HTML 200 element indicates that the document is an HTML document.HyperText Markup Language, or HTML, is the standard markup language fordocuments designed to be displayed in a web browser. The HTML 200element can include user interface elements such as basic web page 201,skip link 202, header/banner 203, navigation menu 204, main landmark205, and/or footer/content info 206.

The header 210 is a content sectioning element allowing organization ofthe document content into logical pieces. Header 210 representsintroductory content, typically a group of introductory or navigationalaids. Header 210 may contain some heading elements but also a logo, asearch form, an author name, and other elements. The header 210 elementcan include user interface elements such as search input, header/banner,and/or navigation menu (not pictured).

The navigation 220 element represents a section of a page that servesthe purpose of providing navigation links, either within the currentdocument or to other documents. Common examples of navigation sectionsare menus, tables of contents, and indexes. The navigation 220 elementcan include user interface elements such as search input 221, paginationnav 222, button 223, link 224, and/or navigation menu 225.

The main 230 element represents the dominant content of the <body>of adocument. The main content area consists of content that is directlyrelated to or expands upon the central topic of a document, or thecentral functionality of an application. The main 230 element caninclude user interface elements such as: Alert; Animation; Expanderaccordion; Figure: map, chart, table; Heading: h1, h2, h3; Image: jpg,gif, png, svg; Modal Dialog; Separator/horizontal rule; Sticky element;Table; Toast Snackbar; Tooltip; Video/audio player; Pagination nav;Progress bar; Link; Carousel; and/or Main landmark (not pictured).

The form 240 element represents a document section containinginteractive controls for submitting information. The form 240 elementcan include user interface elements such as: Alert; Checkbox; Datepicker dialog; Dropdown listbox select; Hint, help, or error; Listboxwith inline autocomplete; Number input; Password input; Radio button;Range slider input; Separator/horizontal rule; Star rating; Stepperinput; Text input; Toast Snackbar; Toggle switch; Progress bar; and/orButton (not pictured).

The footer 250 element is a content sectioning element. The footer 250element represents a footer for its nearest sectioning content orsectioning root element. A <footer>typically contains information aboutthe author of the section, copyright data, or links to relateddocuments. The footer 250 element can include user interface elementssuch as navigation menu and/or footer/content info (not pictured).

FIG. 3 shows user interface categories associated with a nativeapplication. A processor based on the type of program can determine themultiple categories associated with user interface elements. When theprogram is a native application 110B in FIG. 1 , the processor createsthe following categories: controls 300 and notifications 310. Theprocessor chooses the categories so that user interface elementsbelonging to native applications running on various platforms such asAndroid and iOS can be categorized in the categories 300, 310. Theprocessor can also create category specific to a particular platformsuch as iOS. Those categories can include bars and views.

Controls 300 are user interface elements that display content or enableinteraction. Controls are the building blocks of the user interface.Controls 300 can include the following user elements: Button; Captcha;Carousel; Checkbox; Link; Menu; Pagination control;Picker/Spinner/Dropdown; Radio button; Range slider; SegmentedControl/Tab; Stepper; Table row button; Text input; and/or Toggleswitch.

The notifications 310 include Alert/Modal Dialog and/or Toast/snackbar/banner notification. Alert/Modal Dialog notifications interruptusers and demand an action. They are appropriate when a user's attentionneeds to be directed toward important information. Toast, snackbar,banner all refer to a graphical control element that communicatescertain events to the user without forcing the user to react to thisnotification immediately, unlike conventional pop-up windows. Desktopnotifications usually disappear automatically after a short amount oftime. Often their content is then stored in some widget that allows theusers to access past notifications at a more convenient time.

A bar can include a navigation bar, search bar, sidebar, status bar, tabbar, toolbar, etc. The bar can provide various information to the useror receive input from the user. The bar can be vertical, or horizontalon the screen.

A view represents a single item on the user interface. It could be acontrol such as a button or slider, an input text field, or perhaps animage. A view covers a specific area on the screen and can be configuredto respond to touch events.

FIGS. 4A-4B show selection of an appropriate test. When the appropriateuser interface element 400 shown in FIGS. 4A-4B, 410 shown in FIG. 4B(only 2 labeled for brevity) is selected, the processor can generate thepresentation 460 including the indication 420 of the appropriate test.The processor can select the user interface elements 400, 410 byanalyzing the user interface and determining the user interface elements400, 410 included in the user interface. Alternatively, the user canmanually select the user interface elements 400, 410.

The indication 420 of the appropriate test can be split into 3categories: keyboard test 430A, gesture test 440A, and audio test 450A.

When the appropriate test includes the keyboard test 430A, the testindicates a keyboard key and an effect caused by activating the keyboardkey. For example, for testing the user interface element checkbox, thetest indicates:

-   1. Test with the keyboard    -   Tab or arrow keys: Focus visibly moves to the checkbox    -   Spacebar: Activates on iOS and Android    -   Enter: Activates on Android.

The tests 430A, 440A, 450A can indicate keys to test specific operatingsystems and expected effects that depend on the operating system.

When the appropriate test includes the gesture test 440A, the testindicates a gesture and an effect caused by performing the gesture. Forexample, for testing the user interface element checkbox, the testindicates:

-   Swipe: Focus moves to the element, expresses its name, role, state-   Doubletap: Checkbox toggles between checked and unchecked states.

When the appropriate test includes the audio test 450A, the testindicates audio and a function to be performed by the audio. Forexample, for testing the user interface element checkbox, the testindicates that the audio should include:

-   Name: Describes the purpose of the control and matches the visible    label-   Role: Identifies itself as a checkbox in Android and a Button in iOS-   Group: Visible label is grouped with the checkbox in a single swipe-   State: Expresses its state (disabled/dimmed, checked, not checked).

For both native applications and web programs, the group indicates tocheck that labels and controls are a single object to the user. Forexample, if the user element is a checkbox, tapping on the checkboxlabel should activate the checkbox rather than being forced to tap onthe tiny checkbox.

The presentation 460 can be made human readable format or can be in amachine readable format because the testing can be performedautomatically or manually. The indication 420 of the appropriate testcan be editable to allow modification to the indicated tests 430A, 440A,450A.

FIG. 4B shows selection of multiple appropriate tests. In FIG. 4B, boththe checkbox 400 and the menu 410 have been selected. The processorappends the indication 420, 470 of the appropriate tests to create thepresentation 460.

For testing the user interface element “menu,” the keyboard test 430Bindicates:

-   1. Test with the keyboard    -   Tab or arrow keys: Focus visibly moves, confined within the menu    -   Escape: The menu closes and returns focus to the button that        launched it    -   Space: Any buttons or links are activated on iOS and Android    -   Enter: Any buttons or links are activated on Android.

When the appropriate test includes the gesture test 440B, the testindicates a gesture and an effect caused by performing the gesture. Fortesting the user interface element “menu,” the gesture test 440Bindicates:

-   Swipe: Focus moves, confined within the menu.-   Doubletap: This typically activates most elements.

The gesture test 440B tests traversing and interacting with the screenfor people who are blind and cannot see the screen. The gesture test cantest that swiping across the screen moves the screenreader “focus” todifferent elements one by one.

When the appropriate test includes the audio test 450B, the testindicates audio and a function to be performed by the audio. Forexample, for testing the user interface element “menu,” the audio test450B indicates that the audio should include:

-   Name: Purpose of menu is clear.-   Role: May identify itself as a menu, sidebar, or panel. Confining    the user within the menu communicates the context to the screen    reader user that there is a menu or modal present.-   State: When open, other content is inert. Expands/collapses,    closes/opens states are typically announced for a menu, sidebar, or    panel.

The tests 430, 440, 450 can be written in a human readable format,and/or in a machine-readable format. The tests 430, 440, 450 can beperformed manually or automatically. To perform an automated test, theprocessor can receive the indication of the appropriate tests 430, 440,450 in either human readable format or in a machine readable format.Based on the indication of the appropriate tests 430, 440, 450, theprocessor can determine which tests to perform.

For example, if the test indicates an audio test 450A, 450B, theprocessor can execute the program under test, and the audio test for theparticular element under test. In addition, the processor can play theaudio associated with the particular element and perform naturallanguage processing on the audio to determine the content of the audioand whether the content of the audio matches the particular elementunder test.

For example, if the element is a checkbox, the processor can determinewhether the audio corresponds to the output specified in the “Name,”“Role,” “Group,” and “State” requirements of the audio test 450A. Inparticular, to pass the “Name” component of the audio test 450A, theprocessor can determine whether the audio contains the specified output,namely whether the audio correctly states the name of the checkbox. Topass the “Role” component of the audio test 450A, the processor candetermine whether the audio contains the specified output, namelywhether the audio correctly states that the element under test is acheckbox in Android, or a button in iOS. To pass the “Group” componentof the audio test 450A, the processor can determine whether the audiocontains the specified output, namely whether the audio correctly statesthat the name of the element under test is grouped with the checkbox. Topass the “Group” component of the audio test 450A, the processor canalso determine that tapping on the checkbox label activates the checkboxrather than being forced to tap on the tiny checkbox. Label and checkboxshould act as a single unit. Finally, to pass the “State” component ofthe audio test 450A, the processor can determine whether the audiocontains the specified output, namely whether the audio correctly statesif the element under test is checked or unchecked. If the audio passesall four of the tests, the processor can determine that the programsatisfies the audio test 450A. If the audio does not pass all four ofthe tests, the processor can indicate that the program has failed theaudio test.

FIG. 5 is a flowchart that illustrates a process to generate anindication of an appropriate test to perform to ensure that a userinterface provides intended information to a user. In step 500, ahardware or software processor executing instructions described in thisapplication can obtain a representation of the user interface to presentto a user. The representation of the user interface can be a designdescribing the user interface, and/or a functioning user interface. Theuser interface can include a user interface, an audio user interface,and/or a haptic user interface.

In step 510, the processor can determine an element associated with theuser interface such as a radio button, text input, capture, etc. Theelement is configured to provide information to the user, however, theuser interface presentation of the element at least partially fails toprovide the information to the user. For example, the user can bedisabled and not be able to see and may need to interact with the userinterface using audio or gestures when appropriate. The user may not beable to use the mouse and may need to interact with the user interfaceusing a keyboard.

In step 520, based on the element, the processor can determine anappropriate test to perform. To accommodate various disabilities, theappropriate test can indicate a test to perform with a keyboard, agesture test to perform with a mobile screen reader, and an audio testto perform with a screen reader.

In step 530, the processor can generate the indication of theappropriate test categorized by the type of test. When the appropriatetest includes the test to perform with the keyboard, the appropriatetest indication can include a keyboard key and an effect caused byactivating the keyboard key. When the appropriate test includes thegesture test, the appropriate test indication can include a gesture andan effect caused by performing the gesture. When the appropriate testincludes the audio test, the appropriate test indication can includeaudio and a function to be performed by the audio. The processor canprovide the presentation prior to releasing the user interface to theuser.

In step 540, the processor can provide the indication of the appropriatetest prior to releasing the user interface to the user. The indicationcan be a presentation such as a multimedia presentation, textpresentation, an audio presentation, a multimedia presentation, etc. Theindication can be editable. Prior to releasing the user interface to theuser, the processor can perform the appropriate test is performed on theuser interface and can indicate when the user interface passed theappropriate test.

The processor can determine a type of program associated with the userinterface. Based on the type of program, the processor can determinemultiple categories associated with the user interface. The processorcan determine a type of element by categorizing the element associatedwith the user interface into a category among the multiple categories.Finally, the processor can enable a selection of the appropriate test bypresenting the multiple categories, the element, and the categoryassociated with the element.

The user interface can be a web page. When the user interface is a webpage, the processor can determine multiple categories associated withthe user interface including at least four of: HTML, header, navigation,main, form, and footer. The processor can categorize the elementassociated with the user interface into a category among the multiplecategories. The processor can enable a selection of the appropriate testby presenting the multiple categories, the element, and the categoryassociated with the element. The selection can be performed by a user orautomatically.

The user interface can be associated with a native application. When theuser interface is associated with the native application, the processorcan determine multiple categories associated with the user interfaceincluding controls and notifications. The processor can categorize theelement associated with the user interface into a category among themultiple categories. The processor can enable a selection of theappropriate test by presenting the multiple categories, the element, andthe category associated with the element. As mentioned above, theselection can be performed by a user or automatically.

The processor can test the performance of the program. The processor canobtain a program associated with the user interface and the indicationof the appropriate test, where the appropriate test includes the audiotest. The processor can execute the program associated with the userinterface and the audio test. The processor can perform natural languageprocessing on the audio test to determine whether the audio testcorresponds to an output indicated in the appropriate test. Upondetermining that the audio test does not correspond to the outputindicating the appropriate test, the processor can indicate that theprogram did not pass the test.

Computer System

FIG. 6 is a block diagram that illustrates an example of a computersystem 600 in which at least some operations described herein can beimplemented. As shown, the computer system 600 can include: one or moreprocessors 602, main memory 606, non-volatile memory 610, a networkinterface device 612, a video display device 618, an input/output device620, a control device 622 (e.g., keyboard and pointing device), a driveunit 624 that includes a storage medium 626, and a signal generationdevice 630 that are communicatively connected to a bus 616. The bus 616represents one or more physical buses and/or point-to-point connectionsthat are connected by appropriate bridges, adapters, or controllers.Various common components (e.g., cache memory) are omitted from FIG. 6for brevity. Instead, the computer system 600 is intended to illustratea hardware device on which components illustrated or described relativeto the examples of the Figures and any other components described inthis specification can be implemented.

The computer system 600 can take any suitable physical form. Forexample, the computer system 600 can share a similar architecture asthat of a server computer, personal computer (PC), tablet computer,mobile telephone, game console, music player, wearable electronicdevice, network-connected (“smart”) device (e.g., a television or homeassistant device), AR/VR systems (e.g., head-mounted display), or anyelectronic device capable of executing a set of instructions thatspecify action(s) to be taken by the computer system 600. In someimplementations, the computer system 600 can be an embedded computersystem, a system-on-chip (SOC), a single-board computer system (SBC), ora distributed system such as a mesh of computer systems, or it caninclude one or more cloud components in one or more networks. Whereappropriate, one or more computer systems 600 can perform operations inreal time, near real time, or in batch mode.

The network interface device 612 enables the computer system 600 tomediate data in a network 614 with an entity that is external to thecomputer system 600 through any communication protocol supported by thecomputer system 600 and the external entity. Examples of the networkinterface device 612 include a network adapter card, a wireless networkinterface card, a router, an access point, a wireless router, a switch,a multilayer switch, a protocol converter, a gateway, a bridge, a bridgerouter, a hub, a digital media receiver, and/or a repeater, as well asall wireless elements noted herein.

The memory (e.g., main memory 606, non-volatile memory 610,machine-readable medium 626) can be local, remote, or distributed.Although shown as a single medium, the machine-readable medium 626 caninclude multiple media (e.g., a centralized/distributed database and/orassociated caches and servers) that store one or more sets ofinstructions 628. The machine-readable (storage) medium 626 can includeany medium that is capable of storing, encoding, or carrying a set ofinstructions for execution by the computer system 600. Themachine-readable medium 626 can be non-transitory or comprise anon-transitory device. In this context, a non-transitory storage mediumcan include a device that is tangible, meaning that the device has aconcrete physical form, although the device can change its physicalstate. Thus, for example, non-transitory refers to a device remainingtangible despite this change in state.

Although implementations have been described in the context of fullyfunctioning computing devices, the various examples are capable of beingdistributed as a program product in a variety of forms. Examples ofmachine-readable storage media, machine-readable media, orcomputer-readable media include recordable-type media such as volatileand non-volatile memory devices 610, removable flash memory, hard diskdrives, optical disks, and transmission-type media such as digital andanalog communication links.

In general, the routines executed to implement examples herein can beimplemented as part of an operating system or a specific application,component, program, object, module, or sequence of instructions(collectively referred to as “computer programs”). The computer programstypically comprise one or more instructions (e.g., instructions 604,608, 628) set at various times in various memory and storage devices incomputing device(s). When read and executed by the processor 602, theinstruction(s) cause the computer system 600 to perform operations toexecute elements involving the various aspects of the disclosure.

Remarks

The terms “example,” “embodiment,” and “implementation” are usedinterchangeably. For example, references to “one example” or “anexample” in the disclosure can be, but not necessarily are, referencesto the same implementation; and, such references mean at least one ofthe implementations. The appearances of the phrase “in one example” arenot necessarily all referring to the same example, nor are separate oralternative examples mutually exclusive of other examples. A feature,structure, or characteristic described in connection with an example canbe included in another example of the disclosure. Moreover, variousfeatures are described which can be exhibited by some examples and notby others. Similarly, various requirements are described which can berequirements for some examples but no other examples.

The terminology used herein should be interpreted in its broadestreasonable manner, even though it is being used in conjunction withcertain specific examples of the invention. The terms used in thedisclosure generally have their ordinary meanings in the relevanttechnical art, within the context of the disclosure, and in the specificcontext where each term is used. A recital of alternative language orsynonyms does not exclude the use of other synonyms. Specialsignificance should not be placed upon whether or not a term iselaborated or discussed herein. The use of highlighting has no influenceon the scope and meaning of a term. Further, it will be appreciated thatthe same thing can be said in more than one way.

Unless the context clearly requires otherwise, throughout thedescription and the claims, the words “comprise,” “comprising,” and thelike are to be construed in an inclusive sense, as opposed to anexclusive or exhaustive sense; that is to say, in the sense of“including, but not limited to.” As used herein, the terms “connected,”“coupled,” or any variant thereof means any connection or coupling,either direct or indirect, between two or more elements; the coupling orconnection between the elements can be physical, logical, or acombination thereof. Additionally, the words “herein,” “above,” “below,”and words of similar import can refer to this application as a whole andnot to any particular portions of this application. Where contextpermits, words in the above Detailed Description using the singular orplural number may also include the plural or singular numberrespectively. The word “or” in reference to a list of two or more itemscovers all of the following interpretations of the word: any of theitems in the list, all of the items in the list, and any combination ofthe items in the list. The term “module” refers broadly to softwarecomponents, firmware components, and/or hardware components.

While specific examples of technology are described above forillustrative purposes, various equivalent modifications are possiblewithin the scope of the invention, as those skilled in the relevant artwill recognize. For example, while processes or blocks are presented ina given order, alternative implementations can perform routines havingsteps, or employ systems having blocks, in a different order, and someprocesses or blocks may be deleted, moved, added, subdivided, combined,and/or modified to provide alternative or sub-combinations. Each ofthese processes or blocks can be implemented in a variety of differentways. Also, while processes or blocks are at times shown as beingperformed in series, these processes or blocks can instead be performedor implemented in parallel, or can be performed at different times.Further, any specific numbers noted herein are only examples such thatalternative implementations can employ differing values or ranges.

Details of the disclosed implementations can vary considerably inspecific implementations while still being encompassed by the disclosedteachings. As noted above, particular terminology used when describingfeatures or aspects of the invention should not be taken to imply thatthe terminology is being redefined herein to be restricted to anyspecific characteristics, features, or aspects of the invention withwhich that terminology is associated. In general, the terms used in thefollowing claims should not be construed to limit the invention to thespecific examples disclosed herein, unless the above DetailedDescription explicitly defines such terms. Accordingly, the actual scopeof the invention encompasses not only the disclosed examples, but alsoall equivalent ways of practicing or implementing the invention underthe claims. Some alternative implementations can include additionalelements to those implementations described above or include fewerelements.

Any patents and applications and other references noted above, and anythat may be listed in accompanying filing papers, are incorporatedherein by reference in their entireties, except for any subject matterdisclaimers or disavowals, and except to the extent that theincorporated material is inconsistent with the express disclosureherein, in which case the language in this disclosure controls. Aspectsof the invention can be modified to employ the systems, functions, andconcepts of the various references described above to provide yetfurther implementations of the invention.

To reduce the number of claims, certain implementations are presentedbelow in certain claim forms, but the applicant contemplates variousaspects of an invention in other forms. For example, aspects of a claimcan be recited in a means-plus-function form or in other forms, such asbeing embodied in a computer-readable medium. A claim intended to beinterpreted as a means-plus-function claim will use the words “meansfor.” However, the use of the term “for” in any other context is notintended to invoke a similar interpretation. The applicant reserves theright to pursue such additional claim forms in either this applicationor a continuing application.

I claim:
 1. A non-transitory computer-readable medium, storinginstructions, which when executed by a data processor, performsoperations, comprising: obtaining a user selection for identifying anelement associated with a user interface to be tested, wherein a userinterface presentation of the element at least partially fails toprovide information represented by the element to a user with adisability using the user interface; generating an indication userinterface of a test to perform of the user interface presentation of theelement by: indicating an audio test as the test comprising an audio anda function to be performed by the audio, wherein the audio test includesa group, and a state, wherein the group indicates to check whetherselecting a visible label associated with the element selects theelement, and wherein the state indicates to check whether the audiocorrectly describes a selection state associated with the elementincluding selected and unselected; and performing the test on the userinterface; and upon determining that the user interface passed the test,indicating by the indication user interface that the user interface isready for release.
 2. The computer-readable medium of claim 1, whereinthe instructions include: determining a type of program associated withthe user interface; based on the type of program, determining multiplecategories associated with the user interface; categorizing the elementassociated with the user interface into a category among the multiplecategories; and enabling a selection of the test by presenting themultiple categories, the element, and the category associated with theelement.
 3. The computer-readable medium of claim 1, wherein the userinterface is associated with a web page, and wherein the instructionsinclude: determining multiple categories associated with the userinterface including at least four of: HTML, header, navigation, main,form, and footer; categorizing the element associated with the userinterface into a category among the multiple categories; and enabling aselection of the test by presenting the multiple categories, theelement, and the category associated with the element.
 4. Thecomputer-readable medium of claim 1, wherein the user interface isassociated with an application, and wherein the instructions include:determining multiple categories associated with the user interfaceincluding controls and notifications; categorizing the elementassociated with the user interface into a category among the multiplecategories; and enabling a selection of the test by presenting themultiple categories, the element, and the category associated with theelement.
 5. The computer-readable medium of claim 1, wherein the userinterface includes at least two of a graphical user interface, an audiouser interface, or a haptic user interface.
 6. The computer-readablemedium of claim 1, wherein the instructions include: obtaining a programassociated with the user interface and the test, executing the programassociated with the user interface and the audio test; performingnatural language processing on the audio test to determine whether theaudio test corresponds to an output indicated in the test; and upondetermining that the audio test does not correspond to the outputindicating the test, indicating that the program did not pass the test.7. A testing system, comprising: at least one hardware processor; atleast one memory, coupled to the at least one hardware processor, andstoring instructions, which when executed by the at least one hardwareprocessor, implements instructions to: obtain a user selection toidentify an element associated with a user interface to be tested,wherein a user interface presentation of the element at least partiallyfails to provide information represented by the element to a user with adisability using the user interface; generate an indication userinterface of a test to perform of the user interface presentation of theelement by: indicating an audio test as the test comprising an audio anda function to be performed by the audio, wherein the audio test includesa group, and a state, wherein the group indicates to check whetherselecting a visible label associated with the element selects theelement, and wherein the state indicates to check whether the audiocorrectly describes a selection state associated with the elementincluding selected and unselected; and perform the test on the userinterface; and upon determining that the user interface passed the test,indicate by the indication user interface that the user interface isready for release.
 8. The system of claim 7, wherein the instructionsinclude: determining a type of program associated with the userinterface; based on the type of program, determining multiple categoriesassociated with the user interface; categorizing the element associatedwith the user interface into a category among the multiple categories;and enabling a selection of the test by presenting the multiplecategories, the element, and the category associated with the element.9. The system of claim 7, wherein the user interface is associated witha web page, and wherein the instructions include: determining multiplecategories associated with the user interface including at least fourof: HTML, header, navigation, main, form, and footer; categorizing theelement associated with the user interface into a category among themultiple categories; and enabling a selection of the test by presentingthe multiple categories, the element, and the category associated withthe element.
 10. The system of claim 7, wherein the user interface isassociated with an application, and wherein the instructions include:determining multiple categories associated with the user interfaceincluding controls and notifications; categorizing the elementassociated with the user interface into a category among the multiplecategories; and enabling a selection of the test by presenting themultiple categories, the element, and the category associated with theelement.
 11. The system of claim 7, wherein the user interface includesat least two of a graphical user interface, an audio user interface, ora haptic user interface.
 12. The system of claim 7, wherein theinstructions include: obtaining a program associated with the userinterface and the test, executing the program associated with the userinterface and the audio test; performing natural language processing onthe audio test to determine whether the audio test corresponds to anoutput indicated in the test; and upon determining that the audio testdoes not correspond to the output indicating the test, indicating thatthe program did not pass the test.
 13. A method comprising: obtaining auser selection for identifying an element associated with a userinterface to be tested, wherein a user interface presentation of theelement at least partially fails to provide information represented bythe element to a user with a disability using the user interface;generating an indication user interface of a test to perform of the userinterface presentation of the element by: indicating an audio test asthe test comprising an audio and a function to be performed by theaudio, wherein the audio test includes a group, and a state, wherein thegroup indicates to check whether selecting a visible label associatedwith the element selects the element, and wherein the state indicates tocheck whether the audio correctly describes a selection state associatedwith the element including selected and unselected; and performing thetest on the user interface; and upon determining that the user interfacepassed the test, indicating by the indication user interface that theuser interface is ready for release.
 14. The method of claim 13, whereinthe test comprises, when the test requires a keyboard input, indicatinga keyboard key and an effect caused by activating the keyboard key. 15.The method of claim 13, wherein the test comprises, when the testrequires a gesture input, indicating a gesture and an effect caused byperforming the gesture.
 16. The method of claim 13, wherein the testcomprises, testing that the user interface provides desired informationto the user, wherein the user employs a screen reader, a refreshablebraille display, a braille terminal, a speech synthesizer.
 17. Themethod of claim 13, comprising testing that the user interface providesdesired information to the user, wherein the user employs assistivetechnology (AT) interacting with dedicated accessibility APIs oremploying hooking techniques of an operating system.
 18. The method ofclaim 13, comprising testing that the user interface provides desiredinformation to the user via operating system hooking that intercepts anoutput from an application sent to a display and adds audio or Brailleindications of the output.